Overview
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 29165 |
| Missing cells | 9027 |
| Missing cells (%) | 1.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 16.4 MiB |
| Average record size in memory | 590.8 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 11 |
| Boolean | 2 |
Has a mobile phone has constant value "1" | Constant |
Account age_x is highly overall correlated with Account age_y | High correlation |
Account age_y is highly overall correlated with Account age_x | High correlation |
Children count is highly overall correlated with Family member count | High correlation |
Employment length is highly overall correlated with Employment status and 1 other fields | High correlation |
Employment status is highly overall correlated with Employment length | High correlation |
Family member count is highly overall correlated with Children count | High correlation |
Gender is highly overall correlated with Job title | High correlation |
Job title is highly overall correlated with Employment length and 1 other fields | High correlation |
Education level is highly imbalanced (50.6%) | Imbalance |
Dwelling is highly imbalanced (73.3%) | Imbalance |
Has an email is highly imbalanced (56.3%) | Imbalance |
Is high risk is highly imbalanced (87.5%) | Imbalance |
Job title has 9027 (31.0%) missing values | Missing |
ID has unique values | Unique |
Children count has 20143 (69.1%) zeros | Zeros |
Reproduction
| Analysis started | 2026-01-05 18:40:36.854764 |
|---|---|
| Analysis finished | 2026-01-05 18:40:49.974365 |
| Duration | 13.12 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
ID
Real number (ℝ)
Unique
| Distinct | 29165 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5078231.6 |
| Minimum | 5008804 |
|---|---|
| Maximum | 5150485 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | 5008804 |
|---|---|
| 5-th percentile | 5018455.4 |
| Q1 | 5042047 |
| median | 5074666 |
| Q3 | 5114629 |
| 95-th percentile | 5146012.8 |
| Maximum | 5150485 |
| Range | 141681 |
| Interquartile range (IQR) | 72582 |
Descriptive statistics
| Standard deviation | 41824.001 |
|---|---|
| Coefficient of variation (CV) | 0.008235938 |
| Kurtosis | -1.2095593 |
| Mean | 5078231.6 |
| Median Absolute Deviation (MAD) | 38051 |
| Skewness | 0.084511077 |
| Sum | 1.4810662 × 1011 |
| Variance | 1.749247 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5037048 | 1 | < 0.1% |
| 5044630 | 1 | < 0.1% |
| 5079079 | 1 | < 0.1% |
| 5112872 | 1 | < 0.1% |
| 5105858 | 1 | < 0.1% |
| 5100411 | 1 | < 0.1% |
| 5022817 | 1 | < 0.1% |
| 5009811 | 1 | < 0.1% |
| 5113922 | 1 | < 0.1% |
| 5021541 | 1 | < 0.1% |
| Other values (29155) | 29155 |
| Value | Count | Frequency (%) |
| 5008804 | 1 | |
| 5008805 | 1 | |
| 5008806 | 1 | |
| 5008808 | 1 | |
| 5008810 | 1 | |
| 5008813 | 1 | |
| 5008814 | 1 | |
| 5008815 | 1 | |
| 5008819 | 1 | |
| 5008821 | 1 |
| Value | Count | Frequency (%) |
| 5150485 | 1 | |
| 5150482 | 1 | |
| 5150481 | 1 | |
| 5150480 | 1 | |
| 5150478 | 1 | |
| 5150477 | 1 | |
| 5150468 | 1 | |
| 5150465 | 1 | |
| 5150464 | 1 | |
| 5150463 | 1 |
Gender
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 19549 | |
| M | 9616 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 19549 | |
| m | 9616 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 19549 | |
| M | 9616 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 19549 | |
| M | 9616 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 19549 | |
| M | 9616 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 19549 | |
| M | 9616 |
| Value | Count | Frequency (%) |
| False | 18128 | |
| True | 11037 |
Has a property
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.6 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 19557 | |
| False | 9608 |
Children count
Real number (ℝ)
High correlation Zeros
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.43079033 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 20143 |
| Zeros (%) | 69.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.74188219 |
|---|---|
| Coefficient of variation (CV) | 1.7221422 |
| Kurtosis | 23.798772 |
| Mean | 0.43079033 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.5929624 |
| Sum | 12564 |
| Variance | 0.55038919 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20143 | |
| 1 | 6003 | 20.6% |
| 2 | 2624 | 9.0% |
| 3 | 323 | 1.1% |
| 4 | 52 | 0.2% |
| 5 | 15 | 0.1% |
| 7 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 19 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 20143 | |
| 1 | 6003 | 20.6% |
| 2 | 2624 | 9.0% |
| 3 | 323 | 1.1% |
| 4 | 52 | 0.2% |
| 5 | 15 | 0.1% |
| 7 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 19 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 14 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 5 | 15 | 0.1% |
| 4 | 52 | 0.2% |
| 3 | 323 | 1.1% |
| 2 | 2624 | 9.0% |
| 1 | 6003 | 20.6% |
| 0 | 20143 |
Income
Real number (ℝ)
| Distinct | 259 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 186890.39 |
| Minimum | 27000 |
|---|---|
| Maximum | 1575000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | 27000 |
|---|---|
| 5-th percentile | 76500 |
| Q1 | 121500 |
| median | 157500 |
| Q3 | 225000 |
| 95-th percentile | 360000 |
| Maximum | 1575000 |
| Range | 1548000 |
| Interquartile range (IQR) | 103500 |
Descriptive statistics
| Standard deviation | 101409.64 |
|---|---|
| Coefficient of variation (CV) | 0.54261563 |
| Kurtosis | 18.289145 |
| Mean | 186890.39 |
| Median Absolute Deviation (MAD) | 45000 |
| Skewness | 2.7571154 |
| Sum | 5.4506581 × 109 |
| Variance | 1.0283916 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 135000 | 3468 | 11.9% |
| 180000 | 2487 | 8.5% |
| 157500 | 2469 | 8.5% |
| 225000 | 2373 | 8.1% |
| 112500 | 2359 | 8.1% |
| 202500 | 1781 | 6.1% |
| 90000 | 1395 | 4.8% |
| 270000 | 1344 | 4.6% |
| 315000 | 795 | 2.7% |
| 247500 | 686 | 2.4% |
| Other values (249) | 10008 |
| Value | Count | Frequency (%) |
| 27000 | 1 | < 0.1% |
| 29250 | 3 | < 0.1% |
| 30150 | 3 | < 0.1% |
| 31500 | 15 | |
| 31531.5 | 3 | < 0.1% |
| 32400 | 3 | < 0.1% |
| 33300 | 9 | |
| 33750 | 1 | < 0.1% |
| 36000 | 3 | < 0.1% |
| 36900 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 1575000 | 7 | < 0.1% |
| 1350000 | 5 | < 0.1% |
| 1125000 | 3 | < 0.1% |
| 990000 | 3 | < 0.1% |
| 945000 | 3 | < 0.1% |
| 900000 | 28 | |
| 810000 | 13 | |
| 787500 | 1 | < 0.1% |
| 765000 | 5 | < 0.1% |
| 742500 | 4 | < 0.1% |
Employment status
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| Working | |
|---|---|
| Commercial associate | |
| Pensioner | |
| State servant | |
| Student | 7 |
Length
| Max length | 20 |
|---|---|
| Median length | 7 |
| Mean length | 10.8587 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Working |
|---|---|
| 2nd row | Commercial associate |
| 3rd row | Commercial associate |
| 4th row | Commercial associate |
| 5th row | Working |
Common Values
| Value | Count | Frequency (%) |
| Working | 15056 | |
| Commercial associate | 6801 | |
| Pensioner | 4920 | 16.9% |
| State servant | 2381 | 8.2% |
| Student | 7 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| working | 15056 | |
| commercial | 6801 | |
| associate | 6801 | |
| pensioner | 4920 | 12.8% |
| state | 2381 | 6.2% |
| servant | 2381 | 6.2% |
| student | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 33578 | |
| i | 33578 | |
| r | 29158 | 9.2% |
| e | 28211 | 8.9% |
| n | 27284 | 8.6% |
| a | 25165 | 7.9% |
| s | 20903 | 6.6% |
| k | 15056 | 4.8% |
| W | 15056 | 4.8% |
| g | 15056 | 4.8% |
| Other values (11) | 73649 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 316694 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 33578 | |
| i | 33578 | |
| r | 29158 | 9.2% |
| e | 28211 | 8.9% |
| n | 27284 | 8.6% |
| a | 25165 | 7.9% |
| s | 20903 | 6.6% |
| k | 15056 | 4.8% |
| W | 15056 | 4.8% |
| g | 15056 | 4.8% |
| Other values (11) | 73649 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 316694 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 33578 | |
| i | 33578 | |
| r | 29158 | 9.2% |
| e | 28211 | 8.9% |
| n | 27284 | 8.6% |
| a | 25165 | 7.9% |
| s | 20903 | 6.6% |
| k | 15056 | 4.8% |
| W | 15056 | 4.8% |
| g | 15056 | 4.8% |
| Other values (11) | 73649 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 316694 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 33578 | |
| i | 33578 | |
| r | 29158 | 9.2% |
| e | 28211 | 8.9% |
| n | 27284 | 8.6% |
| a | 25165 | 7.9% |
| s | 20903 | 6.6% |
| k | 15056 | 4.8% |
| W | 15056 | 4.8% |
| g | 15056 | 4.8% |
| Other values (11) | 73649 |
Education level
Categorical
Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
| Secondary / secondary special | |
|---|---|
| Higher education | |
| Incomplete higher | 1129 |
| Lower secondary | 298 |
| Academic degree | 25 |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 24.85462 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Secondary / secondary special |
|---|---|
| 2nd row | Higher education |
| 3rd row | Secondary / secondary special |
| 4th row | Higher education |
| 5th row | Secondary / secondary special |
Common Values
| Value | Count | Frequency (%) |
| Secondary / secondary special | 19803 | |
| Higher education | 7910 | 27.1% |
| Incomplete higher | 1129 | 3.9% |
| Lower secondary | 298 | 1.0% |
| Academic degree | 25 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| secondary | 39904 | |
| 19803 | ||
| special | 19803 | |
| higher | 9039 | 9.2% |
| education | 7910 | 8.1% |
| incomplete | 1129 | 1.2% |
| lower | 298 | 0.3% |
| academic | 25 | < 0.1% |
| degree | 25 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 79312 | |
| c | 68796 | |
| 68771 | ||
| a | 67642 | |
| r | 49266 | 6.8% |
| o | 49241 | 6.8% |
| n | 48943 | 6.8% |
| d | 47864 | 6.6% |
| y | 39904 | 5.5% |
| s | 39904 | 5.5% |
| Other values (15) | 165242 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 724885 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 79312 | |
| c | 68796 | |
| 68771 | ||
| a | 67642 | |
| r | 49266 | 6.8% |
| o | 49241 | 6.8% |
| n | 48943 | 6.8% |
| d | 47864 | 6.6% |
| y | 39904 | 5.5% |
| s | 39904 | 5.5% |
| Other values (15) | 165242 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 724885 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 79312 | |
| c | 68796 | |
| 68771 | ||
| a | 67642 | |
| r | 49266 | 6.8% |
| o | 49241 | 6.8% |
| n | 48943 | 6.8% |
| d | 47864 | 6.6% |
| y | 39904 | 5.5% |
| s | 39904 | 5.5% |
| Other values (15) | 165242 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 724885 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 79312 | |
| c | 68796 | |
| 68771 | ||
| a | 67642 | |
| r | 49266 | 6.8% |
| o | 49241 | 6.8% |
| n | 48943 | 6.8% |
| d | 47864 | 6.6% |
| y | 39904 | 5.5% |
| s | 39904 | 5.5% |
| Other values (15) | 165242 |
Marital status
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| Married | |
|---|---|
| Single / not married | |
| Civil marriage | |
| Separated | 1712 |
| Widow | 1233 |
Length
| Max length | 20 |
|---|---|
| Median length | 7 |
| Mean length | 9.3100977 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married |
|---|---|
| 2nd row | Single / not married |
| 3rd row | Married |
| 4th row | Single / not married |
| 5th row | Separated |
Common Values
| Value | Count | Frequency (%) |
| Married | 20044 | |
| Single / not married | 3864 | 13.2% |
| Civil marriage | 2312 | 7.9% |
| Separated | 1712 | 5.9% |
| Widow | 1233 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 23908 | |
| single | 3864 | 9.0% |
| 3864 | 9.0% | |
| not | 3864 | 9.0% |
| civil | 2312 | 5.4% |
| marriage | 2312 | 5.4% |
| separated | 1712 | 4.0% |
| widow | 1233 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 54152 | |
| i | 35941 | |
| e | 33508 | |
| a | 31956 | |
| d | 26853 | |
| M | 20044 | 7.4% |
| 13904 | 5.1% | |
| n | 7728 | 2.8% |
| l | 6176 | 2.3% |
| g | 6176 | 2.3% |
| Other values (10) | 35091 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 271529 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 54152 | |
| i | 35941 | |
| e | 33508 | |
| a | 31956 | |
| d | 26853 | |
| M | 20044 | 7.4% |
| 13904 | 5.1% | |
| n | 7728 | 2.8% |
| l | 6176 | 2.3% |
| g | 6176 | 2.3% |
| Other values (10) | 35091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 271529 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 54152 | |
| i | 35941 | |
| e | 33508 | |
| a | 31956 | |
| d | 26853 | |
| M | 20044 | 7.4% |
| 13904 | 5.1% | |
| n | 7728 | 2.8% |
| l | 6176 | 2.3% |
| g | 6176 | 2.3% |
| Other values (10) | 35091 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 271529 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 54152 | |
| i | 35941 | |
| e | 33508 | |
| a | 31956 | |
| d | 26853 | |
| M | 20044 | 7.4% |
| 13904 | 5.1% | |
| n | 7728 | 2.8% |
| l | 6176 | 2.3% |
| g | 6176 | 2.3% |
| Other values (10) | 35091 |
Dwelling
Categorical
Imbalance
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| House / apartment | |
|---|---|
| With parents | 1406 |
| Municipal apartment | 912 |
| Rented apartment | 453 |
| Office apartment | 208 |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 16.790125 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | With parents |
|---|---|
| 2nd row | House / apartment |
| 3rd row | House / apartment |
| 4th row | House / apartment |
| 5th row | House / apartment |
Common Values
| Value | Count | Frequency (%) |
| House / apartment | 26059 | |
| With parents | 1406 | 4.8% |
| Municipal apartment | 912 | 3.1% |
| Rented apartment | 453 | 1.6% |
| Office apartment | 208 | 0.7% |
| Co-op apartment | 127 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| apartment | 27759 | |
| house | 26059 | |
| 26059 | ||
| with | 1406 | 1.7% |
| parents | 1406 | 1.7% |
| municipal | 912 | 1.1% |
| rented | 453 | 0.5% |
| office | 208 | 0.2% |
| co-op | 127 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 58783 | |
| a | 57836 | |
| e | 56338 | |
| 55224 | ||
| n | 30530 | 6.2% |
| p | 30204 | 6.2% |
| r | 29165 | 6.0% |
| m | 27759 | 5.7% |
| s | 27465 | 5.6% |
| u | 26971 | 5.5% |
| Other values (15) | 89409 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 489684 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 58783 | |
| a | 57836 | |
| e | 56338 | |
| 55224 | ||
| n | 30530 | 6.2% |
| p | 30204 | 6.2% |
| r | 29165 | 6.0% |
| m | 27759 | 5.7% |
| s | 27465 | 5.6% |
| u | 26971 | 5.5% |
| Other values (15) | 89409 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 489684 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 58783 | |
| a | 57836 | |
| e | 56338 | |
| 55224 | ||
| n | 30530 | 6.2% |
| p | 30204 | 6.2% |
| r | 29165 | 6.0% |
| m | 27759 | 5.7% |
| s | 27465 | 5.6% |
| u | 26971 | 5.5% |
| Other values (15) | 89409 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 489684 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 58783 | |
| a | 57836 | |
| e | 56338 | |
| 55224 | ||
| n | 30530 | 6.2% |
| p | 30204 | 6.2% |
| r | 29165 | 6.0% |
| m | 27759 | 5.7% |
| s | 27465 | 5.6% |
| u | 26971 | 5.5% |
| Other values (15) | 89409 |
Age
Real number (ℝ)
| Distinct | 6794 |
|---|---|
| Distinct (%) | 23.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -15979.477 |
| Minimum | -25152 |
|---|---|
| Maximum | -7705 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 29165 |
| Negative (%) | 100.0% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | -25152 |
|---|---|
| 5-th percentile | -23021 |
| Q1 | -19444 |
| median | -15565 |
| Q3 | -12475 |
| 95-th percentile | -9873 |
| Maximum | -7705 |
| Range | 17447 |
| Interquartile range (IQR) | 6969 |
Descriptive statistics
| Standard deviation | 4202.9975 |
|---|---|
| Coefficient of variation (CV) | -0.26302471 |
| Kurtosis | -1.0433005 |
| Mean | -15979.477 |
| Median Absolute Deviation (MAD) | 3424 |
| Skewness | -0.18225185 |
| Sum | -4.6604146 × 108 |
| Variance | 17665188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -15519 | 44 | 0.2% |
| -12676 | 44 | 0.2% |
| -16896 | 33 | 0.1% |
| -16768 | 26 | 0.1% |
| -16053 | 26 | 0.1% |
| -14400 | 25 | 0.1% |
| -14122 | 24 | 0.1% |
| -14667 | 24 | 0.1% |
| -11126 | 24 | 0.1% |
| -22867 | 24 | 0.1% |
| Other values (6784) | 28871 |
| Value | Count | Frequency (%) |
| -25152 | 1 | < 0.1% |
| -25140 | 3 | |
| -25099 | 1 | < 0.1% |
| -25088 | 1 | < 0.1% |
| -25010 | 2 | |
| -24963 | 1 | < 0.1% |
| -24946 | 3 | |
| -24932 | 4 | |
| -24914 | 3 | |
| -24878 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -7705 | 1 | < 0.1% |
| -7723 | 1 | < 0.1% |
| -7757 | 3 | |
| -7959 | 2 | |
| -7980 | 1 | < 0.1% |
| -8041 | 4 | |
| -8054 | 1 | < 0.1% |
| -8056 | 2 | |
| -8069 | 1 | < 0.1% |
| -8076 | 2 |
Employment length
Real number (ℝ)
High correlation
| Distinct | 3483 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59257.761 |
| Minimum | -15713 |
|---|---|
| Maximum | 365243 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 24257 |
| Negative (%) | 83.2% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | -15713 |
|---|---|
| 5-th percentile | -7264 |
| Q1 | -3153 |
| median | -1557 |
| Q3 | -412 |
| 95-th percentile | 365243 |
| Maximum | 365243 |
| Range | 380956 |
| Interquartile range (IQR) | 2741 |
Descriptive statistics
| Standard deviation | 137655.88 |
|---|---|
| Coefficient of variation (CV) | 2.3230018 |
| Kurtosis | 1.1433571 |
| Mean | 59257.761 |
| Median Absolute Deviation (MAD) | 1309 |
| Skewness | 1.7724256 |
| Sum | 1.7282526 × 109 |
| Variance | 1.8949142 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 365243 | 4908 | 16.8% |
| -401 | 61 | 0.2% |
| -200 | 55 | 0.2% |
| -2087 | 53 | 0.2% |
| -1539 | 51 | 0.2% |
| -1678 | 47 | 0.2% |
| -1081 | 47 | 0.2% |
| -2531 | 46 | 0.2% |
| -1160 | 45 | 0.2% |
| -309 | 44 | 0.2% |
| Other values (3473) | 23808 |
| Value | Count | Frequency (%) |
| -15713 | 1 | < 0.1% |
| -15661 | 3 | < 0.1% |
| -15227 | 1 | < 0.1% |
| -15072 | 2 | < 0.1% |
| -15038 | 13 | |
| -14887 | 6 | |
| -14810 | 6 | |
| -14775 | 2 | < 0.1% |
| -14536 | 4 | < 0.1% |
| -14473 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 365243 | 4908 | |
| -17 | 2 | < 0.1% |
| -65 | 1 | < 0.1% |
| -66 | 1 | < 0.1% |
| -70 | 2 | < 0.1% |
| -71 | 1 | < 0.1% |
| -73 | 14 | < 0.1% |
| -78 | 1 | < 0.1% |
| -79 | 1 | < 0.1% |
| -88 | 1 | < 0.1% |
Has a mobile phone
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 29165 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 29165 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 29165 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 29165 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 29165 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 29165 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 22623 | |
| 1 | 6542 | 22.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 22623 | |
| 1 | 6542 | 22.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 22623 | |
| 1 | 6542 | 22.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 22623 | |
| 1 | 6542 | 22.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 22623 | |
| 1 | 6542 | 22.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 22623 | |
| 1 | 6542 | 22.4% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 20562 | |
| 1 | 8603 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 20562 | |
| 1 | 8603 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 20562 | |
| 1 | 8603 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 20562 | |
| 1 | 8603 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 20562 | |
| 1 | 8603 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 20562 | |
| 1 | 8603 |
Has an email
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 | 2633 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 26532 | |
| 1 | 2633 | 9.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 26532 | |
| 1 | 2633 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 26532 | |
| 1 | 2633 | 9.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 26532 | |
| 1 | 2633 | 9.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 26532 | |
| 1 | 2633 | 9.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 26532 | |
| 1 | 2633 | 9.0% |
Job title
Categorical
High correlation Missing
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 9027 |
| Missing (%) | 31.0% |
| Memory size | 1.6 MiB |
| Laborers | |
|---|---|
| Core staff | |
| Sales staff | |
| Managers | |
| Drivers | |
| Other values (13) |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 10.533916 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Core staff |
|---|---|
| 2nd row | Accountants |
| 3rd row | Laborers |
| 4th row | Managers |
| 5th row | Accountants |
Common Values
| Value | Count | Frequency (%) |
| Laborers | 5004 | |
| Core staff | 2866 | 9.8% |
| Sales staff | 2773 | 9.5% |
| Managers | 2422 | 8.3% |
| Drivers | 1722 | 5.9% |
| High skill tech staff | 1133 | 3.9% |
| Accountants | 998 | 3.4% |
| Medicine staff | 956 | 3.3% |
| Cooking staff | 521 | 1.8% |
| Security staff | 464 | 1.6% |
| Other values (8) | 1279 | 4.4% |
| (Missing) | 9027 |
Length
| Value | Count | Frequency (%) |
| staff | 9672 | |
| laborers | 5142 | |
| core | 2866 | 8.8% |
| sales | 2773 | 8.5% |
| managers | 2422 | 7.4% |
| drivers | 1722 | 5.3% |
| high | 1133 | 3.5% |
| skill | 1133 | 3.5% |
| tech | 1133 | 3.5% |
| accountants | 998 | 3.1% |
| Other values (13) | 3567 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 24637 | |
| s | 24596 | |
| r | 20552 | |
| e | 20460 | |
| f | 19344 | 9.1% |
| t | 13921 | 6.6% |
| 12423 | 5.9% | |
| o | 10186 | 4.8% |
| i | 8271 | 3.9% |
| n | 6932 | 3.3% |
| Other values (26) | 50810 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 212132 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 24637 | |
| s | 24596 | |
| r | 20552 | |
| e | 20460 | |
| f | 19344 | 9.1% |
| t | 13921 | 6.6% |
| 12423 | 5.9% | |
| o | 10186 | 4.8% |
| i | 8271 | 3.9% |
| n | 6932 | 3.3% |
| Other values (26) | 50810 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 212132 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 24637 | |
| s | 24596 | |
| r | 20552 | |
| e | 20460 | |
| f | 19344 | 9.1% |
| t | 13921 | 6.6% |
| 12423 | 5.9% | |
| o | 10186 | 4.8% |
| i | 8271 | 3.9% |
| n | 6932 | 3.3% |
| Other values (26) | 50810 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 212132 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 24637 | |
| s | 24596 | |
| r | 20552 | |
| e | 20460 | |
| f | 19344 | 9.1% |
| t | 13921 | 6.6% |
| 12423 | 5.9% | |
| o | 10186 | 4.8% |
| i | 8271 | 3.9% |
| n | 6932 | 3.3% |
| Other values (26) | 50810 |
Family member count
Real number (ℝ)
High correlation
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1975313 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.91218872 |
|---|---|
| Coefficient of variation (CV) | 0.41509704 |
| Kurtosis | 8.6454749 |
| Mean | 2.1975313 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.3103351 |
| Sum | 64091 |
| Variance | 0.83208827 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 15552 | |
| 1 | 5613 | 19.2% |
| 3 | 5121 | 17.6% |
| 4 | 2503 | 8.6% |
| 5 | 309 | 1.1% |
| 6 | 48 | 0.2% |
| 7 | 14 | < 0.1% |
| 9 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 5613 | 19.2% |
| 2 | 15552 | |
| 3 | 5121 | 17.6% |
| 4 | 2503 | 8.6% |
| 5 | 309 | 1.1% |
| 6 | 48 | 0.2% |
| 7 | 14 | < 0.1% |
| 9 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 7 | 14 | < 0.1% |
| 6 | 48 | 0.2% |
| 5 | 309 | 1.1% |
| 4 | 2503 | 8.6% |
| 3 | 5121 | 17.6% |
| 2 | 15552 | |
| 1 | 5613 | 19.2% |
Account age_x
Real number (ℝ)
High correlation
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -26.137734 |
| Minimum | -60 |
|---|---|
| Maximum | 0 |
| Zeros | 247 |
| Zeros (%) | 0.8% |
| Negative | 28918 |
| Negative (%) | 99.2% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | -60 |
|---|---|
| 5-th percentile | -55 |
| Q1 | -39 |
| median | -24 |
| Q3 | -12 |
| 95-th percentile | -3 |
| Maximum | 0 |
| Range | 60 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 16.486702 |
|---|---|
| Coefficient of variation (CV) | -0.63076248 |
| Kurtosis | -1.0342853 |
| Mean | -26.137734 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.28850885 |
| Sum | -762307 |
| Variance | 271.81133 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -7 | 690 | 2.4% |
| -6 | 669 | 2.3% |
| -17 | 659 | 2.3% |
| -5 | 656 | 2.2% |
| -8 | 655 | 2.2% |
| -10 | 645 | 2.2% |
| -11 | 642 | 2.2% |
| -16 | 642 | 2.2% |
| -9 | 629 | 2.2% |
| -12 | 628 | 2.2% |
| Other values (51) | 22650 |
| Value | Count | Frequency (%) |
| -60 | 249 | |
| -59 | 250 | |
| -58 | 270 | |
| -57 | 244 | |
| -56 | 278 | |
| -55 | 285 | |
| -54 | 281 | |
| -53 | 304 | |
| -52 | 367 | |
| -51 | 385 |
| Value | Count | Frequency (%) |
| 0 | 247 | 0.8% |
| -1 | 444 | |
| -2 | 519 | |
| -3 | 626 | |
| -4 | 625 | |
| -5 | 656 | |
| -6 | 669 | |
| -7 | 690 | |
| -8 | 655 | |
| -9 | 629 |
Is high risk
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 | 499 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 28666 | |
| 1 | 499 | 1.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 28666 | |
| 1 | 499 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 28666 | |
| 1 | 499 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 28666 | |
| 1 | 499 | 1.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 28666 | |
| 1 | 499 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29165 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 28666 | |
| 1 | 499 | 1.7% |
Account age_y
Real number (ℝ)
High correlation
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -26.137734 |
| Minimum | -60 |
|---|---|
| Maximum | 0 |
| Zeros | 247 |
| Zeros (%) | 0.8% |
| Negative | 28918 |
| Negative (%) | 99.2% |
| Memory size | 228.0 KiB |
Quantile statistics
| Minimum | -60 |
|---|---|
| 5-th percentile | -55 |
| Q1 | -39 |
| median | -24 |
| Q3 | -12 |
| 95-th percentile | -3 |
| Maximum | 0 |
| Range | 60 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 16.486702 |
|---|---|
| Coefficient of variation (CV) | -0.63076248 |
| Kurtosis | -1.0342853 |
| Mean | -26.137734 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.28850885 |
| Sum | -762307 |
| Variance | 271.81133 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -7 | 690 | 2.4% |
| -6 | 669 | 2.3% |
| -17 | 659 | 2.3% |
| -5 | 656 | 2.2% |
| -8 | 655 | 2.2% |
| -10 | 645 | 2.2% |
| -11 | 642 | 2.2% |
| -16 | 642 | 2.2% |
| -9 | 629 | 2.2% |
| -12 | 628 | 2.2% |
| Other values (51) | 22650 |
| Value | Count | Frequency (%) |
| -60 | 249 | |
| -59 | 250 | |
| -58 | 270 | |
| -57 | 244 | |
| -56 | 278 | |
| -55 | 285 | |
| -54 | 281 | |
| -53 | 304 | |
| -52 | 367 | |
| -51 | 385 |
| Value | Count | Frequency (%) |
| 0 | 247 | 0.8% |
| -1 | 444 | |
| -2 | 519 | |
| -3 | 626 | |
| -4 | 625 | |
| -5 | 656 | |
| -6 | 669 | |
| -7 | 690 | |
| -8 | 655 | |
| -9 | 629 |
Interactions
Correlations
| Account age_x | Account age_y | Age | Children count | Dwelling | Education level | Employment length | Employment status | Family member count | Gender | Has a car | Has a phone | Has a property | Has a work phone | Has an email | ID | Income | Is high risk | Job title | Marital status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Account age_x | 1.000 | 1.000 | 0.057 | -0.005 | 0.013 | 0.012 | 0.077 | 0.015 | -0.027 | 0.013 | 0.044 | 0.026 | 0.014 | 0.024 | 0.018 | -0.002 | -0.026 | 0.064 | 0.025 | 0.030 |
| Account age_y | 1.000 | 1.000 | 0.057 | -0.005 | 0.013 | 0.012 | 0.077 | 0.015 | -0.027 | 0.013 | 0.044 | 0.026 | 0.014 | 0.024 | 0.018 | -0.002 | -0.026 | 0.064 | 0.025 | 0.030 |
| Age | 0.057 | 0.057 | 1.000 | 0.379 | 0.111 | 0.123 | -0.209 | 0.379 | 0.304 | 0.208 | 0.163 | 0.066 | 0.136 | 0.203 | 0.108 | 0.053 | 0.095 | 0.018 | 0.096 | 0.167 |
| Children count | -0.005 | -0.005 | 0.379 | 1.000 | 0.030 | 0.017 | -0.142 | 0.071 | 0.825 | 0.063 | 0.086 | 0.020 | 0.007 | 0.056 | 0.004 | 0.027 | 0.043 | 0.000 | 0.058 | 0.078 |
| Dwelling | 0.013 | 0.013 | 0.111 | 0.030 | 1.000 | 0.051 | 0.114 | 0.062 | 0.067 | 0.083 | 0.041 | 0.040 | 0.204 | 0.039 | 0.027 | 0.032 | 0.052 | 0.008 | 0.072 | 0.055 |
| Education level | 0.012 | 0.012 | 0.123 | 0.017 | 0.051 | 1.000 | 0.147 | 0.096 | 0.028 | 0.014 | 0.106 | 0.057 | 0.042 | 0.046 | 0.095 | 0.042 | 0.109 | 0.005 | 0.204 | 0.047 |
| Employment length | 0.077 | 0.077 | -0.209 | -0.142 | 0.114 | 0.147 | 1.000 | 0.998 | -0.145 | 0.175 | 0.154 | 0.010 | 0.096 | 0.242 | 0.086 | -0.008 | -0.162 | 0.000 | 1.000 | 0.211 |
| Employment status | 0.015 | 0.015 | 0.379 | 0.071 | 0.062 | 0.096 | 0.998 | 1.000 | 0.120 | 0.190 | 0.159 | 0.012 | 0.098 | 0.254 | 0.109 | 0.047 | 0.099 | 0.012 | 0.178 | 0.108 |
| Family member count | -0.027 | -0.027 | 0.304 | 0.825 | 0.067 | 0.028 | -0.145 | 0.120 | 1.000 | 0.105 | 0.115 | 0.025 | 0.018 | 0.055 | 0.027 | 0.026 | 0.024 | 0.006 | 0.059 | 0.155 |
| Gender | 0.013 | 0.013 | 0.208 | 0.063 | 0.083 | 0.014 | 0.175 | 0.190 | 0.105 | 1.000 | 0.360 | 0.029 | 0.047 | 0.061 | 0.000 | 0.050 | 0.201 | 0.015 | 0.558 | 0.164 |
| Has a car | 0.044 | 0.044 | 0.163 | 0.086 | 0.041 | 0.106 | 0.154 | 0.159 | 0.115 | 0.360 | 1.000 | 0.010 | 0.011 | 0.017 | 0.016 | 0.058 | 0.206 | 0.000 | 0.272 | 0.152 |
| Has a phone | 0.026 | 0.026 | 0.066 | 0.020 | 0.040 | 0.057 | 0.010 | 0.012 | 0.025 | 0.029 | 0.010 | 1.000 | 0.065 | 0.312 | 0.010 | 0.065 | 0.046 | 0.000 | 0.067 | 0.042 |
| Has a property | 0.014 | 0.014 | 0.136 | 0.007 | 0.204 | 0.042 | 0.096 | 0.098 | 0.018 | 0.047 | 0.011 | 0.065 | 1.000 | 0.210 | 0.052 | 0.184 | 0.041 | 0.025 | 0.048 | 0.033 |
| Has a work phone | 0.024 | 0.024 | 0.203 | 0.056 | 0.039 | 0.046 | 0.242 | 0.254 | 0.055 | 0.061 | 0.017 | 0.312 | 0.210 | 1.000 | 0.035 | 0.127 | 0.035 | 0.000 | 0.062 | 0.068 |
| Has an email | 0.018 | 0.018 | 0.108 | 0.004 | 0.027 | 0.095 | 0.086 | 0.109 | 0.027 | 0.000 | 0.016 | 0.010 | 0.052 | 0.035 | 1.000 | 0.165 | 0.091 | 0.000 | 0.089 | 0.029 |
| ID | -0.002 | -0.002 | 0.053 | 0.027 | 0.032 | 0.042 | -0.008 | 0.047 | 0.026 | 0.050 | 0.058 | 0.065 | 0.184 | 0.127 | 0.165 | 1.000 | -0.022 | 0.016 | 0.064 | 0.042 |
| Income | -0.026 | -0.026 | 0.095 | 0.043 | 0.052 | 0.109 | -0.162 | 0.099 | 0.024 | 0.201 | 0.206 | 0.046 | 0.041 | 0.035 | 0.091 | -0.022 | 1.000 | 0.000 | 0.112 | 0.032 |
| Is high risk | 0.064 | 0.064 | 0.018 | 0.000 | 0.008 | 0.005 | 0.000 | 0.012 | 0.006 | 0.015 | 0.000 | 0.000 | 0.025 | 0.000 | 0.000 | 0.016 | 0.000 | 1.000 | 0.028 | 0.022 |
| Job title | 0.025 | 0.025 | 0.096 | 0.058 | 0.072 | 0.204 | 1.000 | 0.178 | 0.059 | 0.558 | 0.272 | 0.067 | 0.048 | 0.062 | 0.089 | 0.064 | 0.112 | 0.028 | 1.000 | 0.108 |
| Marital status | 0.030 | 0.030 | 0.167 | 0.078 | 0.055 | 0.047 | 0.211 | 0.108 | 0.155 | 0.164 | 0.152 | 0.042 | 0.033 | 0.068 | 0.029 | 0.042 | 0.032 | 0.022 | 0.108 | 1.000 |
Missing values
Sample
| ID | Gender | Has a car | Has a property | Children count | Income | Employment status | Education level | Marital status | Dwelling | Age | Employment length | Has a mobile phone | Has a work phone | Has a phone | Has an email | Job title | Family member count | Account age_x | Is high risk | Account age_y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5037048 | M | Y | Y | 0 | 135000.0 | Working | Secondary / secondary special | Married | With parents | -16271 | -3111 | 1 | 0 | 0 | 0 | Core staff | 2.0 | -17.0 | 0 | -17 |
| 1 | 5044630 | F | Y | N | 1 | 135000.0 | Commercial associate | Higher education | Single / not married | House / apartment | -10130 | -1651 | 1 | 0 | 0 | 0 | Accountants | 2.0 | -1.0 | 0 | -1 |
| 2 | 5079079 | F | N | Y | 2 | 180000.0 | Commercial associate | Secondary / secondary special | Married | House / apartment | -12821 | -5657 | 1 | 0 | 0 | 0 | Laborers | 4.0 | -38.0 | 0 | -38 |
| 3 | 5112872 | F | Y | Y | 0 | 360000.0 | Commercial associate | Higher education | Single / not married | House / apartment | -20929 | -2046 | 1 | 0 | 0 | 1 | Managers | 1.0 | -11.0 | 0 | -11 |
| 4 | 5105858 | F | N | N | 0 | 270000.0 | Working | Secondary / secondary special | Separated | House / apartment | -16207 | -515 | 1 | 0 | 1 | 0 | NaN | 1.0 | -41.0 | 0 | -41 |
| 5 | 5100411 | F | Y | Y | 0 | 135000.0 | Working | Secondary / secondary special | Married | House / apartment | -13251 | -3839 | 1 | 1 | 0 | 0 | Accountants | 2.0 | -1.0 | 0 | -1 |
| 6 | 5022817 | M | Y | Y | 0 | 202500.0 | Working | Secondary / secondary special | Married | House / apartment | -17262 | -1617 | 1 | 0 | 0 | 0 | Core staff | 2.0 | -16.0 | 0 | -16 |
| 7 | 5009811 | F | N | N | 1 | 202500.0 | Working | Secondary / secondary special | Married | House / apartment | -11813 | -3266 | 1 | 1 | 1 | 0 | Sales staff | 3.0 | -21.0 | 0 | -21 |
| 8 | 5113922 | F | N | N | 0 | 90000.0 | Pensioner | Secondary / secondary special | Single / not married | Municipal apartment | -23478 | 365243 | 1 | 0 | 0 | 0 | NaN | 1.0 | -50.0 | 0 | -50 |
| 9 | 5021541 | F | Y | N | 1 | 306000.0 | Working | Higher education | Married | House / apartment | -9310 | -1678 | 1 | 0 | 0 | 0 | NaN | 3.0 | -13.0 | 0 | -13 |
| ID | Gender | Has a car | Has a property | Children count | Income | Employment status | Education level | Marital status | Dwelling | Age | Employment length | Has a mobile phone | Has a work phone | Has a phone | Has an email | Job title | Family member count | Account age_x | Is high risk | Account age_y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29155 | 5021871 | F | Y | Y | 1 | 315000.0 | State servant | Higher education | Widow | House / apartment | -18233 | -425 | 1 | 0 | 1 | 1 | NaN | 2.0 | -30.0 | 0 | -30 |
| 29156 | 5009779 | M | N | N | 0 | 135000.0 | Working | Secondary / secondary special | Separated | House / apartment | -14118 | -3174 | 1 | 0 | 0 | 0 | Laborers | 1.0 | -4.0 | 0 | -4 |
| 29157 | 5010913 | F | Y | Y | 0 | 81000.0 | Pensioner | Higher education | Married | House / apartment | -20399 | 365243 | 1 | 0 | 0 | 0 | NaN | 2.0 | -43.0 | 0 | -43 |
| 29158 | 5065502 | F | Y | N | 1 | 135000.0 | Working | Higher education | Married | Municipal apartment | -12523 | -2482 | 1 | 0 | 0 | 0 | Managers | 3.0 | -13.0 | 0 | -13 |
| 29159 | 5091339 | F | N | Y | 0 | 135000.0 | Commercial associate | Secondary / secondary special | Married | House / apartment | -11088 | -1447 | 1 | 0 | 1 | 0 | Cooking staff | 2.0 | -3.0 | 0 | -3 |
| 29160 | 5067139 | F | N | Y | 0 | 112500.0 | Pensioner | Secondary / secondary special | Single / not married | House / apartment | -23400 | 365243 | 1 | 0 | 1 | 1 | NaN | 1.0 | -5.0 | 0 | -5 |
| 29161 | 5029193 | F | N | Y | 1 | 135000.0 | Commercial associate | Secondary / secondary special | Married | House / apartment | -15532 | -8256 | 1 | 0 | 0 | 0 | Core staff | 3.0 | -24.0 | 0 | -24 |
| 29162 | 5047710 | F | N | Y | 0 | 76500.0 | Working | Secondary / secondary special | Married | House / apartment | -17782 | -3291 | 1 | 1 | 1 | 0 | Managers | 2.0 | -29.0 | 0 | -29 |
| 29163 | 5009886 | F | N | Y | 0 | 157500.0 | Pensioner | Secondary / secondary special | Civil marriage | House / apartment | -21635 | 365243 | 1 | 0 | 1 | 0 | NaN | 2.0 | -37.0 | 0 | -37 |
| 29164 | 5062632 | F | N | Y | 0 | 585000.0 | Commercial associate | Secondary / secondary special | Married | House / apartment | -18858 | -2010 | 1 | 0 | 1 | 0 | NaN | 2.0 | -43.0 | 0 | -43 |